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Methods for Genetic Diversification 
in Gene Conversion Active Cells 

The present invention relates to a method for directed and selective genetic diversification of a 
target nucleic acid sequence or gene product by exploiting the relationship between 
immunoglobulin gene conversion and hypermutation in antibody-producing cells, as well as to 
cells and cell lines capable of said genetic diversification. 

Many approaches to the generation of diversity in gene products rely on the generation of a very 
large number of mutants which are then selected using powerful selection technologies. However, 
these systems have a number of disadvantages. If the mutagenesis is done in vitro on gene 
constructs which are subsequently expressed in vitro or as transgenes in cells or animals, the gene 
expression in the physiological context is difficult and the mutant repertoire is fixed in time. If 
mutagenesis is on the other hand performed in living cells, it is difficult to direct mutations to a 
target nucleic acid where they are desired. Therefore the efficiency of isolating molecules with 
improved activity by repeated cycles of mutations and selection with sufficient efficiency is 
limited. Moreover, random mutagenesis in vivo is toxic and likely to induce a high level of 
undesirable secondary mutations. 
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In nature, directed diversification of a selected nucleic acid sequence takes place in the rearranged 
V(D)J segments of the immunoglobulin (Ig) gene loci. The primary repertoire of antibody 
specificities is generated by a process of DNA rearrangement involving the joining of 
immunoglobulin V, D 5 and J gene segments. Following antigen encounter, the rearranged V(D) J 
segments in those B cells, whose surface Ig can bind the antigen with low or moderate affinity, are 
subjected to a second wave of diversification by hypermutation. Thus so-called somatic 
hypermutation generates the secondary repertoire from which increased binding specificities are 
selected thereby allowing affinity maturation of the humoral immune response (Milstein and Rada, 
1995). • 



The mouse and man immunoglobulin loci contain large pools of V, D and J gene segments which 
can participate in the V(D)J rearrangement, so that significant diversity is created at this stage by 
random combination. Other species such as chicken, rabbit, cow, sheep and pig employ a different 
strategy to develop their primary Ig repertoire (Butler, 1998). After the rearrangement of a single 
functional V and J segment, further diversification of the chicken light chain gene occurs by gene 
conversion in a specialized lymphoid organ, the Bursa of Fabricius (Reynaud et al., 1987; 
Arakawa and Buerstedde, in press). During this process, stretches of* sequences from non- 
functional pseudo-V-genes are transferred into the rearranged V-gene. The twenty-five pseudo-V- 
genes are situated upstream of the functional V-gene and share sequence homology with the V- 
gene. Similar to the situation in men and mice, affinity maturation after antigen encounter takes 
place by hypermutation in the splenic germinal centers of the chicken (Arakawa et al., 1996). 

All three B cell specific activities of Ig repertoire formation - gene conversion (Arakawa et al, 
2002), hypermutation and isotype switch recombination (Muramatsu et al., 2000; Revy et al, 
2000) - require expression of the Activation Induced Deaminase (ADD) gene. Whereas it was 
initially proposed that AID is a DNA editing enzyme (Muramatsu et al., 1999), more recent studies 
indicate that AID directly modifies DNA by deamination of cytosine to uracil (Di Noia and 
Neuberger, 2002), However, the cytosine deamination activity must be further regulated, because 
only differences in the type, the location and the processing of the AED-induced DNA 
modification can explain the selective occurrence of recombination or hypermutation in different 
species and B cell environments. Based on the finding that certain AED mutations affect switch 
recombination, but not somatic hypermutation, it was suggested that AID needs the binding of a 
co-factor to start switch recombination (Ta et al., 2003; Barreto et al, 2003). 

Analysis of DT40 knock-out mutants indicates that the RAD54 gene QBezzubova et al., 1997) and 
other members of the RAD52 recombination repair pathway are needed for efficient Ig gene 
conversion (Sale et al, 2001). Disruption of RAD51 analogues and paralogues reduces Ig gene 
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conversion and induces hypermutation in the rearranged light chain gene (Sale et al., 2001) 
suggesting that a defect in DNA repair by homologous recombination can shift Ig gene conversion 
to hypermutation. 

5 Recently, first cell systems have been developed which exploit the phenomenon of somatic 
hypermutation in the immunoglobulin locus to generate mutants of a target gene in constitutive 
and directed manner. These cell systems allow to prepare a gene product having a desired activity 
by cyclical steps of mutation generation and selection. Thus, WO 00/22111 and WO 02/100998 
describe a human Burkitt lymphoma cell line (Ramos) which is capable of directed constitutive 
10 hypermutation of a specific nucleic acid region. This mutated region can be the endogenous 

rearranged V segment or an exogenous gene operatively linked to control sequences which direct 
hypermutation. A significant disadvantage of this cell system is that human cells cannot be 
efficiently genetically manipulated by targeted integration, since transfected constructs insert 
primarily at random chromosomal positions. 

15 

WO 02/100998 also describes another cell system for generating genetic diversity in the Ig locus 
which is based on the chicken B cell line DT40. DT40 continues gene conversion of the 
rearranged light chain immunoglobulin gene during cell culture (Buerstedde et al., 1990). 
-Importantly, this cell line has a high ratio of targeted to random integration of transfected 

20 constructs thus allowing efficient genetic manipulation (Buerstedde and Takeda, 1991). According 
to WO 02/100998, deletion in DT40 of the paralogues of the RAD5I g;ene which are involved in 
homologous recombination and DNA repair led to a decrease in gene conversion and a 
simultaneous activation of hypermutation of the rearranged V segment. However, the main 
disadvantage of this system is that the mutant cells have a DNA repair deficiency as reflected by 

25 X-ray sensitivity and chromosomal instability. The mutants also have a low proliferation rate and a 
low gene targeting efficiency. Therefore this system is poorly suited for efficient gene 
diversification and selection. 

The present invention overcomes the disadvantages of the prior art systems and provides further 
30 advantages as well. 

SUMMARY OF THE INVENTION 

In the first aspect of the invention there is provided a genetically modified lymphoid cell having 
35 gene conversion fully or partially replaced by hypermutation, wherein said cell has no deleterious 
mutations in genes encoding paralogues and analogues of the RAD51 gene which encode 
important homologous recombination factors. Specifically, the cell contains wild-type homologous 
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recombination factors. Due to the intact homologous recombination machinery, the cell according 
to the invention is recombination and repair proficient and has a normal proliferation rate. 

The cell of the invention is an immunoglobulin-expressing B lymphocyte derived from animal 
species which use the mechanism of gene conversion for developing their irnmunoglobulin 
repertoire. These species are for example chicken, sheep, cow, pig and rabbit. Preferably, the cell 
is derived from a chicken Bursal lymphoma. Most preferably, the cell is derived from or related to 
the DT40 cell line. 

In a further embodiment, the cell according to the invention is capable of directed and selective 
genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The target nucleic acid may encode a protein or possess a 
regulatory activity. Examples of proteins are an immunoglobulin chain, a selection marker, a 
DNA-binding protein, an en2yme, a receptor protein or a part thereof. In a preferred embodiment, 
the target nucleic acid is the V(D) J segment of a rearranged human immunoglobulin gene. 
Examples of regulatory nucleic acids are a transcription regulatory element or a KNAi sequence. 

In an embodiment, in which the target nucleic acid is diversified by a combination of 
hypermutation and gene conversion, the cell according to the invention contains at least one 
sequence capable of serving as a gene conversion donor for the target nucleic acid. 

In a further embodiment, the target nucleic acid is an exogenous nucleic acid operably linked to 
control nucleic acid sequences that direct genetic diversification. 

In an additional embodiment, the target nucleic acid is expressed in the cell according to the 
invention in a manner that facilitates selection of cells which exhibit a desired activity. The 
selection can be a direct selection for the activity of the target nucleic acid within the cell, on the 
cell surface or outside the cell. Alternatively, the selection can be an indirect selection for the 
activity of a reporter nucleic acid. 

In a further embodiment, the invention provides for genetic means to modulate the genetic 
diversification of the target nucleic acid in the cell according to the invention. The modulation can 
be by modification of cis-acting regulatory sequences, by varying the number of gene conversion 
donors, or by modification of trans-acting regulatory factors such as activation-induced deaminase 
(AID) or a DNA repair or recombination factor other than a RAD 51 analogue or paralogue. The 
cell preferably expresses activation-induced deaminase (AID) conditionally. 
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In a second aspect, there is provided a cell line derived from a cell according to the invention. In a 
preferred embodiment, the cell line is DT40 or a modification thereof. 

In a third aspect, there is provided a transgenic non-human animal containing a lymphoid cell 
5 having gene conversion fully or partially replaced by hypermutation, wherein said cell has no 
deleterious mutations in genes encoding paralogues and analogues of the RAD51 protein, and 
wherein said cell is capable of directed and selective genetic diversification of a transgenic target 
nucleic acid by hypermutation or a combination of hypermutation and gene conversion. In a 
preferred embodiment, the animal is chicken. 

10 

In a further aspect, the invention provides a method for preparing a cell capable of directed and 
selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The method comprises (a) transfecting a lymphoid cell 
capable of gene conversion with a genetic construct containing the target nucleic acid, and (b) 
15 identifying a cell having the endogenous V-gene segment of a part thereof replaced with the target 
nucleic acid. 

According to a further embodiment, the genetic construct containing the target nucleic acid further 
contains at least one nucleic acid capable of serving as a gene conversion donor for the target 
20 nucleic acid. The locus containing the target nucleic acid can be constructed by a single 

transfection or multiple rounds of transfection with constructs containing different components of* 
the locus. 

In the embodiment, in which selection for a cell with a desired activity is indirect, the method of 
25 the invention further comprises (c) transfecting the cell from step (b) with a further genetic 
construct comprising a reporter gene capable of being influenced by the target nucleic acid. 

In a further embodiment, the method of the invention farther comprises (d) conditional expressiorx 
of a trans-acting regulatory factor. In a preferred embodiment, the trans-acting regulatory factor is 
30 activation-induced deaminase (AID). 

According to a particularly preferred embodiment, the target nucleic acid is inserted into the cell 
by targeted integration. 

35 In a further aspect, there is provided a method for preparing a gene product having a desired 

activity, comprising the steps of: (a) culturing cells according to the invention under appropriate 
conditions to express the target nucleic acid, (b) identifying a cell or cells within the population of 
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cells which expresses a mutated gene product having the desired activity; and (c) establishing one 
or more clonal populations of cells from the cell or cells identified in step (b), and selecting from 
said clonal populations a cell or cells which expresses a gene product having an improved desired 
activity. 

5 

In one embodiment, steps (b) and (c) are iteratively repeated until a gene product with an 
optimized desired activity is produced. 

According to a further embodiment, the genetic diversification can be switched off, for example, 
10 by down-regulation of the expression of a trans-acting regulatory factor, when the cell producing a 
gene product with an optimized desired activity has been identified. The trans-acting regulatory 
factor can be, for example, activation-induced deaminase (AID) or a factor involved in 
homologous recombination or DNA repair, other than a RAD51 paralogue or analogue. 

15 In another embodiment, the diversification of the target nucleic acid is further modified by target 
sequence optimization such as the introduction of Ig hypermutation hotspots or an increased GC 
content. 

In a further aspect of the present invention, there is provided the use of a cell capable of directed 
20 and selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion for the preparation of a gene product having a desired activity. 

DESCRIPTION OF THE FIGURES 

25 Fig. 1 M/V gene deletion (A) A physical map of the chicken rearranged Ig light chain locus 

and the \j/V knock-out constructs. The locus contains a total of 25 \|/V genes upstream of 
functional V segment. The knock-out strategy of yV genes by the targeted integration of the 
p\{/VDell-25 and the p\j/VDe!3-25 constructs is shown below. Only the relevant EcoRI sites are 
indicated. (B) Southern blot analysis of wild-type and knock-out clones using the probe shown in 

30 (A) after EcoRI digestion. The wild-type locus hybridizes as a 12-kb fragment, whereas 

yypartiai and yv-i 0c i hybridize as a 7.4-kb and 6.3-kb fragment, respectively. (C) AID status. The 
AID gene was amplified by PGR to verify the presence or absence of AID cDNA expression 
cassette. 

35 Fig. 2 slgM expression analysis of control and \|/V knock-out clones (A) FACS anti-IgM 

staining profiles of representative subclones derived from initially sIgM(+) clones. (B) Average 
percentages of events falling into slgM(-) gates based on the measurement of 24 subclones. 



WO 2005/080552 



PCT/EP2005/001897 



Fig. 3 Ig light chain sequence analysis of the \\rV knock-out clones Mutation profiles of 
the AIDVV" and AK>Vv partial clones. All nucleotide substitutions identified in different 
sequences in the region from the leader sequence to the J-C intron are mapped onto the rearranged 
5 light chain sequence present in the AID R precursor clone. Mutations of the AID R \|/V" and the 
AID R \|/V partJal clones are shown above and below the reference sequenoe, respectively. Deletions, 
insertions and gene conversion events are also indicated. Hotspot motLfs (RGYW and its 
complement WRCY) are highlighted by bold letters. 

10 Fig. 4 Mutation profiles of hypermutating cell lines (A) Percentages of sequences 

carrying a certain number of mutations. Each untemplated nucleotide substitution is counted, but 
gene conversion, deletions and insertions involving multiple nucleotides are counted as a single 
event. PM, point mutation; GC, gene conversion; D, deletion; I, insertion. (B) Pattern of nucleotide 
substitutions within sequences from \|/V and the XRCC3 knock-out clones. Nucleotide 

15 substitutions as part of gene conversion events are excluded. The ratios of transition (trs) to 

transversion (trv) are also shown. (C) Hotspot preference of untemplated nucleotide substitution 
mutations. Mutations occurring within a hotspot motif (RGYW or its complement WRCY) are 
shown by percentages. (D) Trypan-blue positive cells as an indicator of spontaneously dying cells. 

20 Fig. 5 Distribution of nucleotide substitutions within genomic sequences from unsorted 

AED R \|/V" cells and within cDNA sequences from sorted IgM (-) AIO R ij/V~ cells The number of 
mutations are counted for every 50 bp, and are shown together with the corresponding physical 
maps of the light chain genomic locus or the cDNA sequence. 

25 Fig. 6 A model explaining the regulation of Ig gene conversion* and Ig hyper mutation 

Fig. 7 In situ mutagenesis of the GFP gene (A) Ig VJ replacement vector. (B) in vivo 
mutagenesis of the GFP gene by hypermutation. (C) \|/V donor replacement vector. (D) in vivo 
mutagenesis of GFP gene by gene conversion and hypermutation. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention makes available a particularly useful cell system f?or directed and selective 
genetic diversification of any nucleic acid by hypermutation or a combination of hypermutation 
35 and gene conversion. The system is based on B cell lines which constitu-tively diversify the 
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rearranged immunoglobulin V-gene in vitro without requiring extracellular stimuli such as an 
interaction with other cells or molecules or maintenance of the B cell antigen receptor. 

As used herein, "directed and selective diversification" refers to the ability of certain cells to cause 
5 alteration of the nucleic acid sequence of a specific region of endogenous or transgenic nucleic 
acid, whereby sequences outside of these regions are not subjected to mutation. 

"Genetic diversification" refers to alteration of individual nucleotides or stretches of nucleotides in 
a nucleic acid. Genetic diversification in the cells according to the invention occurs by 
10 hypermutation, gene conversion or a combination of hyp emulation and gene conversion. 

"Hypermutation" refers to the mutation of a nucleic acid in a cell at a rate above background. 
Preferably, hypermutation refers to a rate of mutation of between 1CT 5 and 10" 3 bp" 1 generation' 1 . 
This is greatly in excess of background mutation rates, which are of order of 10" 9 to 10" 10 
15 mutations bp" 1 generation" 1 (Drake et ak 1988) and of spontaneous mutations observed in PGR. 
Thirty cycles of amplification with Pfu polymerase would produce <0.05xl0~ 3 mutations bp" 1 in 
the product, which in the present case would account for less than 1 in 100 of the observed 
mutations (Lundberg et aL, 1991). 

20 "Gene conversion" refers to a phenomenon in which sequence information is transferred in 

unidirectional manner from one homologous allele to the other. Gene conversion may be the result 
of a DNA polymerase switching templates and copying from a homologous sequence, or the result 
of mismatch repair (nucleotides being removed from one strand and replaced by repair synthesis 
using the other strand) after the formation of a heteroduplex. 

25 

Hypermutation and gene conversion generate natural diversity within the immunoglobulin V(D) J 
segment of B cells. Hypermutation takes place in the germinal centers of such species as mouse 
and human following antigen stimulation. Gene conversion takes place in primary lymphoid 
organs like the Bursa of Fabricius or the gut-associated lymphoid tissue in such species as chicken, 
30 cow, rabbit, sheep and pig independent of antigen stimulation. In chicken, stretches from the 
upstream pseudo-V-genes are transferred into the rearranged V(D) J segment. According to the 
present invention, therefore, the cell or cell line is preferably an immunoglobulin-producing cell or 
cell line which is capable of diversifying its rearranged immunoglobulin genes. 

35 A direct connection between the initiation of hypermutation and gene conversion is for the first 
time established in the experiments reported herein. Specifically, partial or complete deletion of 
pseudo-V-genes in a cell line which continues gene conversion in cell culture leads to the 
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activation of hypermutation in the immunoglobulin locus. Deletion of all pseudogenes results in 
the abolishment of gene conversion and simultaneous activation of high rates of hypermutation, 
whereas deletion of a few pseudogenes results in the down-regulation of gene conversion and 
simultaneous activation of hypermutation at rates lower than the ones observed for the complete 
5 pseudogene deletion. Therefore, the number of available pseudogene donors directly correlates 
with gene conversion rates and inversely correlates with hypermutation rates. Gene conversion and 
hypermutation are established to be in a reciprocal relationship to each other. Thus, the present 
invention for the first time provides a cell system which allows to genetically diversify a target 
nucleic acid by a combination of hypermutation and gene conversion, whereby the contribution of 
10 these two phenomena can be regulated by changing the number of the gene conversion donors, 
their orientation or their degree or length of homology. 

An advantage of the cell system according to the invention over a cell system with only 
hypermutating activity such as the one based on the human Burkitt lymphoma cell line Ramos 

1 5 (WO 00/22 1 1 1 and WO 02/100998) is the ability to combine genetic diversification by 

hypermutation and gene conversion in one cell. For example, more defined changes can be 
introduced into the target gene by gene conversion than by random hypermutation, since gene 
conversion donors can be engineered to contain sequences likely to influence the target nucleic 
acid activity in a favorable way. Gene conversion and hypermutation might thus increase the 

20 chance to produce desirable variants, since pre-tested sequence blocks are combined with random 
hypermutations. Pseudogenes with sequences identical to a certain region of the target gene can 
also be used to keep a part of the target nucleic acid stable by frequent conversions having the 
effect that the hypermutations persist only in the non-converting part. This approach is useful 
when the target nucleic acid contains region which should remain stable for optimal activity. 

25 

An advantage of the cell system according to the invention over a cell system based on the 
suppression of homologous recombination activity in gene conversion active cells (WO 
02/100998) is genetic stability of the cell reflected in a normal proliferation rate, radiation 
resistance and DNA repair competence. 

30 

A particular advantage of the present cell system over all known systems is the ability of the cells 
according to the invention to integrate transfected nucleic acid constructs by targeted integration 
into the homologous endogenous locus. 

35 "Targeted integration" is integration of a transfected nucleic acid construct comprising a nucleic 
acid sequence homologous to an endogenous nucleic acid sequence by homologous recombination 
into the endogenous locus. Targeted integration allows to directly insert any nucleic acid into a 
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defined chromosomal position. In a preferred embodiment, a nucleic acid encoding a gene product: 
of interest is inserted by targeted integration into the immunoglobulin locus in place of the 
- rearranged V(D) J segment or a portion thereof. 

5 In a preferred embodiment, the cells according to the invention are derived or related to cells 

which undergo Ig gene conversion in vivo. Cells which undergo Ig gene conversion in vivo are, for" 
example, surface Ig expressing B cells in primary lymphoid organs such as avian Bursal B cells. 
Lymphoma cells, derived from B cells of primary lymphoid organs, are particularly good 
candidates for constructing cells and cell lines according to the present invention. In the most 
10 preferred embodiment, the cells are derived from a chicken Bursal lymphoma cell line DT40. 

The process of constitutive genetic diversification by hypermutation and gene conversion is used 
in the present invention to produce gene products with a desired, novel or improved, activity. 

15 A "target nucleic acid" is a nucleic acid sequence or chromosomal region in the cell according to 
the present invention which is subjected to direct and selective genetic diversification. The target 
nucleic acid can be either endogenous or transgenic and may comprise one or more transcription 
units encoding gene products. 

20 As used herein, a "transgene" is a nucleic acid molecule which is inserted into a cell, such as by 
transfection or transduction. For example, a transgene may comprise a heterologous transcription 
unit which may be inserted into the genome of a cell at a desired location. 

In one embodiment, transgenes are immunoglobulin V-genes as found in immunoglobulin- 
25 producing cells or fragments of V-genes. Preferably, the target nucleic acid is a human 

immunoglobulin V-gene. In this case, the cells according to the invention are "factories" of human 
antibody variants capable of binding to any given antigen. 

Alternatively, the target nucleic acid is a non-immunoglobulin nucleic acid, for example a gene 
30 encoding selection markers, DNA-binding proteins, enzymes or receptor proteins. For example, a 
novel fluorescent selection marker can be produced by mutating a known marker by 
hypermutation or by a combination of hypermutation and gene conversion with help of other 
known markers with a different fluorescent spectrum serving as gene conversion donors. 

35 In one embodiment of the invention, the target nucleic acid directly encodes a gene product of 

interest. Gene diversification of such a nucleic acid will result in a truncation of the encoded gene 
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product or in a change of its primary sequence. With every round of diversification and selection, a 
cell expressing the gene product with an improved activity is search for. 

Alternatively, the target nucleic acid is a regulatory element, for example, a transcription 
5 regulatory element such as promoter or enhancer, or interfering RNA (RNAi). In this embodiment, 
an additional nucleic acid (reporter gene) which is influenced by the target nucleic acid and 
encodes an identifiable gene product is required to identify cells bearing the target nucleic acid of 
interest. 

1 0 In the embodiment, in which genetic diversification of the target nucleic acid takes place by a 

combination of hypermutation and gene conversion, additional nucleic acids capable of serving as 
gene conversion donors are inserted into the cell genome, preferably upstream of the target nucleic 
acid. 

15 A "nucleic acids capable of serving as a gene conversion donor" is a nucleic acid having a 

sequence homologous to the target nucleic acid. Examples of natural gene conversion donors are 
pseudo-V-genes in the immunoglobulin locus of certain species. 

According to one embodiment of the invention, a cell capable of directed and selective 
diversification of the target nucleic acid is constructed by inserting the target nucleic acid into the 
host cell by targeted integration at a defined chromosomal site. For this purpose, the transfected 
constructs may contain upstream and downstream of the target nucleic acid sequences homologous 
to the desired chromosomal integration site. Preferably, the cell is constructed by replacing the 
endogenous V-gene or segments thereof with a transgene by homologous recombination, or by 
gene targeting, such that the transgene becomes a target for the gene conversion and/or 
hypermutation events. 

In another embodiment, transgenes according to the invention also comprise sequences which 
direct hypermutation and/or gene conversion. Thus, an entire locus capable of expressing a gene 
product and directing hypermutation and gene conversion to this transcription unit is transferred 
into the cells and is actively diversified even after random chromosomal integration. 

Screening of clones having incorporated the transgene by targeted integration can be done by 
Southern blot analysis or by PGR. 
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In a preferred embodiment, transgenes according to the invention contain a selectable marker gene 
which allows selection of clones which have stablely integrated the transgene. This selectable 
marker gene may subsequently be removed by recombination or inactivated by other means. 

The present invention further provides a method for preparing a gene product having a desired 
activity by repeated rounds of cell expansion and selection for cells bearing a target nucleic acid 
with a desired activity. As used herein, "selection" refers to the determination of the presence of 
sequence alterations in the target nucleic acid which result in a desired activity of the gene product 
encoded by the target nucleic acid or in a desired activity of the regulatory function of the target 
nucleic acid. 

The process of gene conversion and hypermutation is employed in vivo to generate improved or 
novel binding specificities in immunoglobulin molecules. Thus, by selecting cells according to the 
invention which produce immunoglobulins capable of binding to the desired antigen and then 
propagating these cells in order to allow the generation of further mutants, cells which express 
immunoglobulins having improved binding to the desired antigen may be isolated. 

A variety of selection procedures may be applied for the isolation of mutants having a desired 
specificity. These include Fluorescence Activated Cell Sorting (FACS), cell separation using 
magnetic particles, antigen chromatography methods and other cell separation techniques such as 
use of polystyrene beads. 

Fluorescence Activated Cell Sorting (FACS) can be used to isolate cells on the basis of their 
differing surface molecules, for example surface displayed immunoglobulins. Cells in the sample 
or population to be sorted are stained with specific fluorescent reagents which bind to the cell 
surface molecules. These reagents would be the antigen(s) of interest linked (either directly or 
indirectly) to fluorescent markers such as fluorescein, Texas Red, malachite green, green 
fluorescent protein (GFP), or any other fluorophore known to those skilled in the art. The cell 
population is then introduced into the vibrating flow chamber of the FACS machine. The cell 
stream passing out of the chamber is encased in a sheath of buffer fluid such as PBS (Phosphate 
Buffered Saline). The stream is illuminated by laser light and each cell is measured for 
fluorescence, indicating binding of the fluorescent labeled antigen. The vibration in the cell stream 
causes it to breakup into droplets, which carry a small electrical charge. These droplets can be 
steered by electric deflection plates under computer control to collect different cell populations 
according to their affinity for the fluorescent labeled antigen. In this manner, cell populations 
which exhibit different affinities for the antigen(s) of interest can be easily separated from those 
cells which do not bind the antigen. FACS machines and reagents for use in FACS are widely 
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available from sources world-wide such as Becton-Dickinson, or from service providers such as 
Arizona Research Laboratories (http://www.arl.arizoria.edu/facs/). 

Another method which can be used to separate populations of cells according to the affinity of 
5 their cell surface protein(s) for a particular antigen is affinity chromatography. In this method, a 
suitable resin (for example CL-600 Sepharose, Pharmacia Inc.) is covalently linked to the 
appropriate antigen. This resin is packed into a column, and the mixed population of cells is passed 
over the column. After a suitable period of incubation (for example 20 minutes), unbound cells are 
washed away using (for example) PBS buffer. This leaves only that subset of cells expressing 

10 immunoglobulins which bound the antigen(s) of interest, and these cells are then eluted from the 
column using (for example) an excess of the antigen of interest, or by enzymatically or chemically 
cleaving the antigen from the resin. This may be done using a specific protease such as factor X, 
thrombin, or other specific protease known to those skilled in the art to cleave the antigen from the 
column via an appropriate cleavage site which has previously been incorporated into the antigen- 

15 resin complex. Alternatively, a non-specific protease, for example trypsin, may be employed to 
remove the antigen from the resin, thereby releasing that population of cells which exhibited 
affinity for the antigen of interest. 

The present invention provided for the first time a mechanism which allows to regulate genetic 
20 diversification of the target nucleic acid. As demonstrated by the present inventors, activation- 
induced deaminase (AID) is a factor which regulates gene conversion as well as hypermutation in 
the immunoglobulin locus. It is suggested that AID induces a common modification in the 
rearranged V(D) J segment leading to a conversion tract in the presence of adjacent donor 
sequences and to a point mutation in their absence. Therefore, by regulation of AID expression, 
25 both phenomena can be modulated. In a preferred embodiment, the AID gene is transiently 

expressed in the cell containing a target nucleic acid. For example, AID can be expressed under a 
drug-responsive promoter such as the tetracycline responsible gene expression system. Otherwise 
the gene expression may be shut down by the excision of the AID expression cassette by induced 
recombination. Switching off the AID expression will prevent further diversification of the target 
30 sequence. Preferably, AID expression is switched off in the cell producing a gene product with a 
desired activity in order to prevent further mutations which can lead to the loss of the desired 
activity. 

The invention is illustrated by the following examples. 

35 
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EXAMPLES 



1. AID initiates immunoglobulin gene conversion and hypermutation by a 
common intermediate 

5 

Herein it is reported that ablation of \|/V donors activates AID-dependent Ig hypermutation in 
chicken B cell line DT40. This shows that Ig gene conversion and hypermutation are competing 
pathways derived from the same AID-initiated intermediate. Furthermore \}/V knock-out DT40 is 
proposed as an ideal model system to approach the molecular mechanism of Ig hypermutation and 
10 as a new tool for in situ mutagenesis. 

Methods 



Cell lines. DT40 Crel which displays increased Ig gene conversion due to a v-myb transgene 
1 5 and contains a tamoxifen inducible Cre recombinase has been described previously (Arakawa et 
al. 5 2001). DT40 Crel AID" / " was generated by the targeted disruption of both AID alleles of DT40 Crel 
(Arakawa et al., 2002). AID R was derived from DT40 Crel AJD' A after stable integration of a floxed 
AID-IRES-GFP bicistronic cassette, in which both AID and GFP are expressed from the same |3- 
actin promoter. AIDNj/V was derived from AID R by transfection of pyVDell-25 (Fig. 1 A). Stable 
20 transfectants which had integrated the construct into the rearranged light chain locus were then 

identified by locus specific PGR. Targeted integration of pv|/VDell-25 results in the deletion of the 
entire \)/V gene loci starting 0.4 kb upstream of \j/V25 and ending one bp downstream of \j/Vl. 
AJD R \j/V partial was produced in a similar way as AID R \|/V" by transfection of p\j/VDel3-25 which 
upon targeted integration leads to a partial deletion of the \|/V loci starting 0.4 kb upstream of 
25 \yV25 and ending one bp downstream of \|/V3. Cell culture and electroporation were performed as 
previously described (Arakawa et al, 2002). XRCC3 7 " was derived from DT40 Crel by deleting 
amino acids 72 - 170 of XRCC3 gene following transfection of XRCC3 knock-out constructs. 
Clones having undergone targeted integration were initially identified by long-range PGR and the 
XRCC3 deletion was then confirmed by Southern blot analysis. 

30 

Ig reversion assay. Subcloning, antibody staining, flow cytometry and quantification of 
slgM expression has been described previously (Arakawa et al., 2002). All clones used in the 
study were sIgM(+) due to the repair of the light chain frameshift of the original Cll8(-) variant 
(Buerstedde et al, 1990) by a gene conversion event. 



35 



WO 2005/080552 



PCT/EP2005/001897 



PGR. To minimize PCR-introduced artificial mutations, PfuUltra hotstart polymerase 
(Stratagene) was used for amplification prior to sequencing. Long-range PGR, RT-PCR and Ig 
light chain sequencing were performed as previously described (Arakawa et al., 2002). The 
promoter and J-C intron region of Ig light chain plasmid clones were sequenced using the Ml 3 
5 forward and reverse primers. Bu-1 and EFla genes were amplified using BU1/BU2 (BUI, 
GGGAAGCTTGATCATTTCCTGAATGCTATATTCA; BU2, 
GGGTCTAGAAACTCCTAGGGGAAACTTTGCTGAG) and EF6/EF8 (EF6, 
GGGAAGCTTCGGAAGAAAGAAGCTAAAGACCATC; EF8, 

GGGGCTAGCAGAAGAGCGTGCTCACGGGTCTGCC) primer pairs, respectively. The PGR 
10 products of these genes were cloned into the pBluescript plasmid vector, and were sequenced 
using the M13 reverse primer. 

Results 

15 Targeted deletion of vj/V donor sequences in the rearranged light chain locus 

Two \|/V knock-out constructs were made by cloning genomic sequences, which flank the 
intended deletion end points, upstream and downstream of a floxed-gpt (guanine phosphoribosyl 
transferase) cassette (Arakawa et al., 2001). Upon targeted integration, the first construct, 
p\j/VDell-25, deletes all pseudogenes (v|/V25 to yVl) whereas the second construct, p\|/VDel3 -25, 

20 deletes most pseudogenes (\j/V25 to \|/V3) (Fig. 1 A). A surface IgM positive (sIgM(+)) clone, 

derived from DT40 Crel AID' /_ cells (Arakawa et al., 2002) by transfection and stable integration of a 
floxed AID-IRES (internal ribosome entry site) -GFP transgene, was chosen for the transfection of 
the \]/V knock-out constructs. This AID reconstituted clone, named A1D R has the advantage that 
the appearance of deleterious Ig light chain mutations can be easily detected by the loss of slgM 

25 expression and that GFP-marked AID expression can be shut down after tamoxifen induction of 
the Cre recombinase transgene inherited from DT40 Crel (Arakawa et al., 2002). 

Following transfection of the \j/V knock-out constructs into the AID R clone, mycophenolic 
acid resistant clones containing targeted deletions of the rearranged light chain locus were 
identified. These primary vj/V knock-out clones contain two floxed transgenes, the inserted gpt 

30 marker gene in the rearranged light chain locus and the AID-IRES-GFP gene of the AID R 
progenitor clone. Since the gpt gene might perturb the adjacent transcription or chromatin 
configuration, the primary \|/V knock-outs were exposed to a low concentration of tamoxifen and 
then subcloned by limited dilution. In this way, secondary \|/V knock-out clones could be isolated 
which had either deleted only the gpt gene (AID R i|/V~ and AID R \|/V partiaI ) or the gpt gene together 

35 with the AID-IRES-GFP gene (AID^\|/V and AID" ; "\}/V partial ). The disruption of \j/V genes in the 
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rearranged light chain locus and the excision of AID over-expression cassette were confirmed by 
Southern blot analysis (Fig. IB) and PGR (Fig. 1C), respectively. 

Increased loss of slgM expression after deletion of \j/V genes in AID positive clones 

To estimate the rates of deleterious Ig mutations, slg expression was measured by FAGS 
after two weeks culture for 24 subclones each of the DT40 Crel , AID R , DT40 Crel AID" 7 " and \|/V 
knock-out clones (Figs. 2A and 2B). Analysis of the controls with the intact \]/V locus revealed an 
average of 0.52% and 2.27% slgM(-) cells for the DT40 Crel and AJD R subclones respectively, but 
only 0.08% for the DT40 Crel AID v ". Previous analysis of spontaneously arising sIgM(-) DT40 
variants demonstrated that about a third contained frameshift mutations in the rearranged light 
chain V segment which were regarded as byproducts of the Ig gene conversion activity 
(Buerstedde et al. a 1990). This view is now supported by the finding that the AID negative 
DT40 Crel AID"'" clone, which should have lost the Ig gene conversion activity, stably remains 
sIgM(+). Most interestingly, subclones of the AID positive \|/V knock-out clones (AID\V partial 
and AIDNiAT) rapidly accumulate slgM(-) populations whereas subclones of the AID negative \|/V 
knock-out clones (AID"Vv partiaI and AID V >V") remain sIgM(+) (Figs. 2A and 2B). This suggests 
that the deletion of the pseudogenes dramatically increases the rate of deleterious light chain 
mutations in AID expressing cells. 

Replacement of Ig gene conversion by hyper mutation in the absence of \j/V donors 

To analyze the newly identified mutation activity, the rearranged light chain VJ segments of 
the \|/V knock-out clones were sequenced 5-6 weeks after subcloning. A total of 135 nucleotide 
changes (Fig. 4A, Table 1) were found in the 0.5 kb region between the V leader and the 5' end of 
the J-C intron within 95 sequences from the AIDNj/V" clone (Fig. 3, above reference sequence). In 
contrast to the conversion tracts seen in wild-type DT40 cells, almost all changes are single base 
substitutions and apart from a few short deletions and di-nucleotide changes, mutation clusters 
were not observed. The lack of conversion events in AID R \j/V", which still contains the \]/V genes 
of the unrearranged light chain locus, confirms that Ig gene conversion only recruits the \j/V genes 
on the same chromosome for the diversification of the rearranged light chain gene (Carlson et aL, 
1990). No sequence diversity was found in a collection of 95 light chain gene sequences from the 

AID~ f '\]fV clone (Fig. 4A, Table 1), indicating that AID is required for the mutation activity. 

Sequences derived from the AID R \}/V partiaI clone occasionally display stretches of mutations 

which can be accounted for by the remaining \|/V1 and \}/V2 (Fig. 3, below reference sequence). 

Nevertheless, the majority of AID R \|/V partial mutations are single untemplated base substitutions as 

seen with the AID R \|/V" cells (Fig. 4A, Table 1). Only 3 base substitutions, which possibly are PGR 
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artifacts, were found in 92 sequences of the AID^>V pattial clone confirming that both the gene 
conversion and the mutation activities of AID^f V partial are AID dependent. 

The new mutation activity of the \|/V knock-out clones closely resembles somatic 
5 hypermutation 

The discovered Ig mutation activity in the v/V knock-out clones with a predominance of 
single nucleotide substitutions suggests that somatic hypermutation had replaced Ig gene 
conversion. There is however a difference between the nucleotide substitutions in the AID VV partial 
and AID R \|/ V " clones and Ig hypermutations in germinal center B cells in that the clones show very 

10 few mutations in A/T bases and a preference for transversion mutations (Fig. 4B). 

Ig hypermutations are typically localized within one kb of the transcribed gene sequence 
with preferences for the Complementary Determining Regions (CDRs) of the V(D)J segments, 
whereas no or few mutations are present in the downstream C region (Lebecque and Gearhart, 
1990). To investigate whether the mutations in the AID R \|/V" clone follow a similar distribution, 

1 5 sequence analysis was extended to the promoter region and the J-C intron of the rearranged light 
chain gene (Fig. 5). Although mutations are found close to the promoter and in the intron 
downstream of the J segments, the peak incidence clearly coincides with the CDR1 and CDR3, 
which are also preferred sites of gene conversion in DT40 (unpublished results). Approximately 
half of all point mutations fall within the RGYW (R » A/G; Y = C/T; W = A/T) sequence motif or 

20 its complement WRCY (Fig. 4C), known as hot spots of Ig hypermutation in humans and mice. 

It was previously reported that the deletion of RAD51 paralogues induces Ig hypermutation 
in DT40 (Sale et al., 2001). To compare the hypermutation activity in the \}/V gene negative and 
RAD51 paralogue negative backgrounds, the XRCC3 gene was disrupted in the DT40 Crel clone 

25 and the rearranged VJ genes were sequenced 6 weeks after subcloning. Similar to the mutation 

spectrum in the ATCAf/V' clone and what was previously reported (Sale et al., 2001), the mutations 
in the sequences from the XRCC3" 7 " cells show a transversion preference and an absence of 
mutations in A/T bases (Fig. 4B). Nevertheless the mutation rate in the XRCC3 mutant was about 
2.5 fold lower than in the AID R \{A r clone and there was a clear slow growth phenotype of the 

3 0 XRCC3 mutant compared to wild-type DT40 and the AID \V clone (Fig. 4D). 

To identify the mutations responsible for the loss of slgM expression in the AID^/V" clone, 
94 light chain cDNAs from sorted slgM(-) cells were amplified and sequenced. Although one 
short insertion and five deletions were detected in this collection (Table 1), 89% of the 245 total 
35 mutations are single nucleotide substitutions within the VJ segments (Fig. 5). Surprisingly, only 
about 10% of the sequences contained a stop codon or a frameshift, suggesting that the lack of 
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sIgM(-) expression is mainly caused by amino acid substitutions which affect the pairing of the Ig 
light and heavy chain proteins. 

Ig locus specificity of hypermutation 

It has been reported that high AID expression in fibroblasts (Yoshikawa et aL, 2002) and B 
cell hybridomas (Martin and Scharff, 2002) leads to frequent mutations in Ixansfected transgenes. 
To rule out that the pseudogene deletions had induced a global hypermutator phenotype, the 5' 
ends of the genes encoding the B cell specific marker Bu-1 and the translation elongation factor 
EFlcc were sequenced for the AID^j/V" clone. Only a single one bp deletion was found within 95 
sequences of the Bu-1 gene and only two single nucleotide substitutions within 89 sequences of 
EFlcc (Table 1). As these changes most likely represent PCR artifacts, this further supports the 
view that the hypermutations induced by the \|/V deletions are Ig locus specific. 

Discussion 

The results demonstrate that the deletion of the nearby pseudogene donors abolishes Ig gene 
conversion in DT40 and activates a mutation activity which closely resembles Ig hypermutation. 
The features shared between the new activity and somatic hypermutation include 1) ATP 
dependence, 2) a predominance of single nucleotide substitutions, 3) distribution of the mutations 
within the 5' transcribed region, 4) a preference for hotspots and 5) Ig gene specificity. The only 
difference with regard to Ig hypermutation in vivo is the relative lack of mutations in A/T bases 
and a predominance of transversion mutations in the \\rV knock-out clones. However, this 
difference is also seen in hypermutating EBV transformed B cell lines (Bachl and Wabl, 1996; 
Faili et aL, 2002) and DT40 mutants of RAD51-paralogues (Sale et aL, 2001) indicating that part 
of the Ig hypermutator activity is missing in transformed B cell lines. Interestingly, the rate of Ig 
hypermutation in the AJD R v]/V clone seems higher than the rate of Ig gene conversion in the 
DT40 Crel progenitor. An explanation for this could be that some conversion tracts are limited to 
stretches of identical donor and target sequences and thus leave no trace. 

The induction of Ig hypermutation by the blockage of Ig gene conversions supports a simple 
model explaining how hypermutation and recombination is initiated and regulated (Fig. 6). At the 
top of the events is a modification of the rearranged V(D)J segment which is either directly or 
indirectly induced by AID. The default processing of this lesion in the absence of nearby donors or 
in the absence of high homologous recombination activity leads to Ig hypermutation in form of a 
single nucleotide substitution (Fig. 6, right side). However, if donor sequences are available, 
processing of the AID induced lesion can be divided into a stage before strand exchange, when a 
shift to Ig hypermutation is still possible and a stage after strand exchange when the commitment 
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toward Ig gene conversion has been made (Fig. 6, left side). Whereas completion of the first stage 
requires the participation of the RAD 51 paralogues, the second stage involves other recombination 
factors like the RAD 54 protein. 

This difference in commitment explains why disruptions of the RAD51 paralogues not only 
decrease Ig gene conversion, but also induce Ig hypermutation (Sale et al., 2001) whereas 
disruption of the RAD 5 4 gene only decreases Ig gene conversion (Bezzubova et al., 1997). The 
model also predicts that low cellular homologous recombination activity prevents Ig gene 
conversion even in the presence of conversion donors. Such a low homologous recombination 
activity might be the reason why human and murine B cells never use Ig gene conversion despite 
the presence of nearby candidate donors in form of unrearranged V segments and why chicken 
germinal center B cells shave shifted from Ig gene conversion to Ig hypermutation (Arakawa et al., 
1998). 

The AID R and the \}/V knock-out DT40 clones are a powerful experimental system to 
address the role of trans-acting factors and cis-acting regulatory sequences for Ig gene conversion 
and hypermutation. Compared to alternative animal or cell culture systems it offers the advantages 
of: 1) parallel analysis of Ig gene conversion and Ig hypermutation, 2) conditional AID expression, 
3) easy genome modifications by gene targeting, 4) normal cell proliferation and repair proficiency 
and 5) Ig locus specificity of hypermutation. The ability to induce gene specific hypermutation in 
the DT40 cell line might also find applications in biotechnology. One possibility is to replace the 
chicken antibody coding regions by their human counterparts and then to simulate antibody 
affinity maturation from a repertoire which continuously evolves by Ig hypermutation. 

2. Targeted in vivo mutagenesis of GFP by gene conversion and 
hypermutation 

The gene encoding Green Fluorescent Protein (GFP) is an example of a target nucleic acid which 
can be genetically diversified using the cell system of the invention, in particular the DT40 cell 
line. The GFP gene inserted into the Ig light chain locus by targeted integration will be subjected 
to hypermutation and its activity with respect to color, intensity and half-life will evolve with time 
(Fig. 7B). If a combination of hypermutation and gene conversion is used to modify the GFP 
activities, variant GFP sequences which can serve as gene conversion donors for GFP are also 
inserted into the Ig locus (Fig. 7D). 

An Ig VJ replacement vector, pVjRepBsr, which allows to replace the Ig light chain VJ gene by 
any nucleic acid target is depicted in Fig. 7A. A potential target for mutagenesis can be cloned into 
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Spel site, which is compatible with Xbal, Nhel, Avrll and Spel sites. For example, the GFP gene 
can be inserted into the Ig light chain locus by targeted integration using pVjRepBsr. A \j/V-gene 
donor replacement vector, pPseudoRepBsr, which allows to replace the Ig \)/V gene light chain 
locus by any nucleic acid target is depicted in Fig. 7C. Potential gene conversion donors can be 
5 cloned into either Nhel or Spel site, which is compatible with Xbal, Nhel, AvrIT and Spel sites. 
Because Nhel site is located between two loxPs, this site can be used for conditional knockout 
design. By stepwise targeted integration using pPseudoRepGpt and pVjRepBsr, \j/V genes can be 
replaced by \|/GFP gene and its variants (e.g. \j/CFP: cyano fluorescence protein and \|/YFP: yellow 
fluorescence protein) and the VJ gene can be replaced by GFP carrying a frameshift mutation 
10 (FsGFP) to monitor genetic diversification of the GFP gene. The frameshift in FsGFP is expected 
to be repaired by gene conversion of yGFP, yCFP and \|/YFP as templates. In addition, the gene 
will be further diversified by hypermutation. 
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